Speech-driven Language-barrier-free Multi-lingual Information Access
نویسنده
چکیده
While the number of machine readable information accessible via the World Wide Web is growing, the digital divide problem caused by various barriers has of late become crucial. This paper describes research examples and projects of information access aimed to overcome barriers associated with human languages.
منابع مشابه
Multi-lingual and Multi-modal Speech Processing and Applications
Over the last decade voice technologies for telephony and embedded solutions became much more mature, resulting in applications providing mobile access to digital information from anywhere. Both a growing demand for voice driven applications in many languages and the need for improved usability and user experience now drives the exploration of multi-lingual speech processing techniques for reco...
متن کاملDifferent Approaches to Build Multilingual Conversational Systems
The paper describes developments and results of the work being carried out during the European research project CATCH-2004 (Converse in AThens Cologne and Helsinki). The objective of the project is multi-modal, multi-lingual conversational access to information systems. This paper concentrates on issues of the multilingual telephony-based speech and natural language understanding components.
متن کاملMulti-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds
The aim of this work is to exploit the acoustic-phonetic similarities between several languages. In recent work cross{ language HMM-based phoneme models have been used only for bootstrapping the language{dependent models and the multi{lingual approach has been investigated only on very small speech corpora. In this paper, we introduce a statistical distance measure to determine the similarities...
متن کاملMulti-lingual concatenative speech synthesis
This paper describes a method of concatenative speech synthesis that makes use of 3-dimensional labelling of speech, and shows how this can be applied to the synthesis of both mono-lingual and foreign-language speech. The dimensions encode phonetic, prosodic, and voicequality information in order to fully describe the acoustic characteristics of each speech segment.
متن کاملCross Lingual Modelling Experiments for Indonesian
The extension of Large Vocabulary Continuous Speech Recognition (LVCSR) to resource poor languages such as Indonesian is hindered by the lack of transcribed acoustic data and appropriate pronunciation lexicons. Research has generally been directed toward establishing robust cross-lingual acoustic models, with the assumption that phonetic lexicons are readily available. This is not the case for ...
متن کامل